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1. Introduction 

The physical origin of the acceleration of the universe remains unknown since its discovery 
in 1998 using supernovae (SN) observations (Riess et al. 1998; Perlmutter et al. 1999). It might 
imply that there exists a repulsive ‘dark energy’ component dominating the universe, or that we 
need a better understanding of the law of gravity, i.e., the general relativity (GR) might need to be 
modified on cosmological scales (For a recent review of modified gravify fheories, see Cliffon ef al. 
2012). Allhough dark energy (DE) and modified gravity (MG) can accelerate fhe universe al Ihe 
background level in fhe same way afler fhe required luning, fhe degeneracy can be broken when 
fhe cosmic slruclure formalion is invesligaled. 

In Ihis era of precision cosmology, a combinalion of multiple observalion probes including 
SN, cosmic microwave background (CMB), and large scale slruclure (ESS) surveys is key lo unveil 
Ihe mystery of fhe cosmic acceleration (Weinberg el al. 2013). This is because differenl kinds of 
surveys can be highly complemenlary, e.g., Ihe weak tensing (WE) and redshifl surveys are able 
lo probe y{k,z), quantifying Ihe deviation of photon’s Irajeclory from Ihe geodesics, and /r(k,z), 
Ihe time and spatial variation of Ihe Newton’s conslanl respectively, which are Iwo differenl effecls 
predicted by a wide range of MG models, making Ihe combination of WE wilh redshifl surveys 
robusl for GR lesls, as well as for Ihe dark energy sludies. 

Given lhal Ihe CMB and WE surveys of Planck (Planck Collaboration el al. 2014) and Dark 
Energy Survey (DES) ^ are accumulating dala, we need large redshifl surveys to complemenl. The 
BOSS speclroscopic survey (Anderson el al. 2014) of SDSS-III is currenlly Ihe largesl redshifl 
survey worldwide, mapping Ihe 10,000 square degree sky up to z = 0.7 by fracing 1.5 million 
luminous galaxies. If will be succeeded by eBOSS a mulli-lracer speclroscopic survey of SDSS- 
IV, which will focus on a smaller palch of Ihe sky (7500 square degree) bul going deeper. According 
to Ihe forecasl, il will achieve 1-2% dislance measuremenl from Ihe baryon acoustic oscillations 
(BAO) belween 0.6 < z < 2.5. The Square Kilomelre Array (SKA) ^ HI galaxy redshifl survey can 
provide us wilh accurate redshifls (using Ihe 21cm line) of millions of sources over a wide range of 
redshifls, making il an ideal redshifl survey for cosmological sludies (Bull el al. 2014; Raccanelli 
el al. 2014; Bacon el al. 2014; Kilching el al. 2014; Santos el al. 2014; Abdalla el al. 2014). 

Traditionally, observational conslrainls on DE or MG using eilher currenl or fulure dala are 
usually performed in a paramelerised fashion, i.e., Ihe equation of slate of DE, w(z), or Ihe li.{k,z) 
and 7(k,z) functions quantifying Ihe effecl of MG (Zhao el al. 2009b) are paramelerised using 
assumed function forms, and Ihen Ihe observational conslrainls on Ihese parameters are worked 
out Simple as il is, Ihis approach has ils drawbacks, 

• Il may cause theoretical bias: Ihe resull largely depends on Ihe functional form used for Ihe 
paramelrisalion, which is a priori. The functional forms are usually chosen for Ihe purpose 
of simplicity, or for Ihe assumed Iheorelical consistency, or for bolh; 

'More details of the Dark Energy Survey are available at http://www.dark-energysurvey.org/ 

^More details of the eBOSS survey are available at http://www.sdss.org/sdss-surveys/eboss/ 

^More details of the SKA survey are available at https://www.skatelescope.org/ 

“'There are other ways to parameterise the effect of MG, e.g., see Baker et al. (2013). 
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• The number of parameters are usually minimised, e.g., the CPL parametrisation (Cheval- 
lier & Polarski 2001; Linder 2003) of w{z) has 2 parameters, while the BZ parametrisation 
(Bertschinger & Zukin 2008) for MG has 5 parameters. This can yield a reasonably good 
constraint on the reconstructed w{z), or the MG functions even when data is weak, but it 
might under fit the data when data is excellent. 

However, in nonparametric methods, including the principal component analysis (PCA), it is 
the assumption, rather than the number of parameters, that is minimised, hence it can largely avoid 
the theoretical bias. 

In this chapter, we use the PCA method to perform the forecast of w{z) and using a 

SKA HI redshift galaxy survey. 

2. Methodology 

In this section, we employ a standard Fisher matrix technology (Tegmark et al. 1997) to per¬ 
form the future forecast. 

2.1 The Fisher matrix formulism 

For a redshift survey, the Fisher matrix formalism reads (Seo & Eisenstein 2007) 


Fij 

P{k,fL) 

keff (k, ft) 


(b + f^YPik) 

nP(k)(l+j3ft^)2 

nP(k)(l+j8ft2)2 + l 


dpi dpj 




( 2 . 1 ) 

( 2 . 2 ) 

(2.3) 


where P{k,pL),Ven denote the power spectrum in redshift space and the effective volume respec¬ 
tively, and Fsur is the actual volume of the redshift survey. We have used the Kaiser formula, i.e., Eq 
(H to evaluate P(k,ft), where P{k) is the linear matter spectrum calculated using CAMB (Eewis 
et al. 2000), b and / are the linear bias and the growth function respectively. To account for the 
Einger of God (EoG) effect, we have chosen £ to be 4 Mpc, which is consistent with simulations. 

The Eisher matrix formulae for CMB and WE surveys are elaborated in Zhao et al. (2009b). 


2.2 Specifications of future SKA HI surveys 

A future SKA HI redshift survey will trace the galaxies at radio wavelengths, and the redshifts 
will be measured precisely using the emission lines. In this work, we consider Phase 1 and Phase 
2 of SKA HI surveys (dubbed SKAl and SKA2 respectively). SKAl will achieve an RMS flux 
sensitivity of — 70— lOOftJy with SKAl-MID or SUR, surveying over 5000 deg2 in 10,000 
hours. The expected total number of galaxies in Phase 1 is roughly 5 million at redshift z < 0.5 with 
a 5a detection. In Phase 2, a 10,000 hours survey over 30,000 deg2 will detect one billion galaxies 

^Note that this is the Fisher matrix for a given redshift bin. The final Fisher matrix is the sum over the Fisher 
matrices of individual redshift bins. 
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at a 10a detection level. The expected galaxy distribution and bias for SKAl (SKA2) is shown in 
Fig[I[ For more details of the survey specifications, see Santos et al. (2014). Although SKAl is not 
able to compete with the BOSS survey, SKA2 will surpass any planned spectroscopic surveys in 
the optical bands at z < 1.4. 

2.3 Cosmological parameters 

To be generic, we parameterise the universe using parameters 

P = {Q.bh^ ,Q.ch^ ^h^T.ns^As.Wi.fLij^yij] (2.4) 

where Q.bh^ and Q.ch^ are energy density of baryons and cold dark matter respectively, h is the 
Hubble constant, T is the optical depth, ris and are the spectral index and the amplitude of the 
primordial power spectrum respectively, w denotes the equation-of-state of dark energy. In general, 
we treat w{z) as a unknown function and determine how many degrees of freedom of it can be 
constrained using the PCA method (Huterer & Starkman 2003; Crittenden et al. 2009, 2012; Zhao 
et al. 2009a; Asaba et al. 2013; Hall et al. 2013). To do this, we bin w in the late-time universe, 
namely, 0 < z < 30 using M + 1 z-bins, and consider the value of w in each bin as an independent 
parameter. Since the surveys we consider in this work will not be able to probe z > 3 in detail, we 
use M bins linear in z for 0 < z < 3 and a single bin for 3 < z < 30. 

The /I and y’s are modified gravify paramefers and fhey are defined as follows. 

In Newfonian gauge, fhe linear scalar perfurbafions fo fhe flal Friedmann-Roberfson-Walker 
mefric read, 

ds^ = -a^(r])l(l +2'¥(x,r]))dr]^ - (1 -2<P(x,r]))dfj, (2.5) 

where rj is fhe conformal lime and a(T]) fhe scale factor. In Fourier space, one can write (Hu & 
Sawicki 2007; Berlschinger & Zukin 2008), 

= —iJ.(k,a)47iGa^pA (2.6) 

<I>/'P = Y(k,a) (2.7) 

where A is fhe comoving mailer densify perfurbafion. The funclion y describes anisofropic slresses, 

while p describes a lime- and scale-dependenf rescaling of Newlon’s consfanl G, as well as fhe 
effecls of DE clustering or massive neulrinos. In ACDM, fhe anisofropic slress due lo radialion is 
negligible during mailer domination, fhus jj. = y= 1. 

Similar to >v(z), we Ireal ix{k,a) and y{k,a) as unknown functions and forecasl how well we 
can conslrain fhe eigenmodes of Ihem using PCA. Since fhey are 2-variable functions in bolh k and 
a, we have to pixelise Ihem in fhe (k,z) plane. We pixelise fhe lale-fime and large-scale universe 
(0 < z < 30,10~^ <k< 0.2 hMpc^^) info M-|- 1 z-bins and N k-bins, wilh each of fhe (M-|-1) x A 
pixels having independenl values of and yj. We consider >v(z) as anolher unknown funclion, 
allowing each of fhe M z-bins lo have an independenl value of w,. We use M bins linear in z 
for 0 < z < 3 and a single bin for 3 < z < 30. We choose M = N = 20 and have checked lhal Ihis 
pixelisalion is fine enough to ensure Ihe convergence of Ihe resulls. We use logarilhmic k-bins on 
superhorizon scales and linear k-bins on subhorizon scales, to optimize compulalional efficiency. 
As in Zhao el al. (2009b), we only consider information from scales well-described by linear per- 
lurbalion Iheory, which is only a fraction of fhe (k,z)-volume probed by fulure surveys. Since fhe 
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redshift z 


Figure 1: Upper panel; The expected galaxy distribution for SKAl and SKA2; Lower panel; the correspond¬ 
ing bias as a function of redshift. 


evolution equations (Zhao et al. 2009b) contain time-derivatives of Y{k,z) and w{z), we 

follow Crittenden et al. (2009) and Zhao et al. (2009a) and use hyperbolic tangent functions to rep¬ 
resent steps in these functions in the z-direction, while steps in the ^-direction are left as step func¬ 
tions. The total number of free parameters in our forecast is therefore (M+ 1){2N +1) + 17 = 878. 

2.4 The principal component analysis (PCA) method 

The PCA method is a traditional method in data analysis. It helps to identify the principal com¬ 
ponents (PCs) of data by maximising the data covariance matrix. In cosmology, PCA has been used 
in determining the well-constrained combinations ^ of cosmological parameters, e.g., the binned 
equation of state w{z) of dark energy (Huterer & Starkman 2003; Crittenden et al. 2009, 2012) and 
the pixelised 2-variable functions of /i(k,z) and 7 (k,z) (Zhao et al. 2009a; Asaba et al. 2013; Hall 
et al. 2013), which quantify the deviation from general relativity on cosmological scales. 

Generically, the PCA method can be formulated as follows. Let F be an A x A Fisher infor¬ 
mation matrix for a parameter set P = {pi,P 2 ,---,Pn}- We can find the eigenmodes of F by matrix 
diagonalisation, namely, 

F = W^ AW, (2.8) 

where A = diag(Ai, A 2 ,..., A^?), and W is the transformation matrix relating P to Q, which is a set 

®The PCA method discussed here identifies the best measured linear combinations of cosmological parameters, 
although extensions exist, e.g., the kernel PCA method (Schblkopf et al. 1998) can optimise the constraint on nonlinear 
combinations of parameters. 
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of new parameters Q = {q\,q 2 , ...,qi^}. P and Q are related via, 

Q = WP (2.9) 

The matrices A and W store the eigen-values and eigen-vectors of F: IT tells how to map the old 
correlated parameters, the p’s, to the new orthogonal ones, the q’s, and A quantifies the uncertainty 
on the q's. The best measured parameter is the q with the minimal error (the one corresponding to 
the maximum entry in matrix A). 

For dark energy, the p’s are the binned w{z) in redshift z. W helps to locate the ‘sweet-spots’ 
(the redshifts where the error on w(z) get minimised), and A quantifies how ‘sweet’ they are (the 
size of errors when measuring these modes). For modified gravity, the p’s are the pixelised func¬ 
tions of ft(k,z) and Y{k,z) in the (k,z) plane, and the eigen-vectors in this case are 2D surfaces. 

3. Results 

For a given set of parameter values, we use MGCAMB (Zhao et al. 2009b; Hojjati et al. 
2011) to compute the observables. We generate numerical derivatives of observables with respect 
to parameters, and use the specifications for the experiments to compute the Fisher information 
matrix, which defines the sensitivity of the experiments to these parameters (see Zhao et al. 2009b 
for computational details). Our fiducial values are in all cases ACDM: yjy = = —w, = 1 for all 

i and j, and the fiducial values of the other parameters are those of Planck. 

Besides the SKA HI surveys, we consider the two-point correlations (both auto- and cross-) be¬ 
tween weak leasing shear (WL), and cosmic microwave background (CMB) temperature anisotropy, 
plus the CMB E-mode polarization and its correlation with the CMB temperature. Detailed descrip¬ 
tions of our assumptions for each measurement are found in Zhao et al. (2009b). WL is sourced by 
the sum of the potentials (*P-|-<I>). CMB data probe the Integrated Sachs-Wolfe effect (ISW) which 
depends on 5 (<I>-|- 'P)/5 t] . Thus, measuring WL over multiple redshift bins, along with CMB data, 
yields information about the relation between 'P and <I> and their response to matter density fluc¬ 
tuations. Lor our forecasts, we assume the following probes: Planck (Planck Collaboration et al. 
2014) for CMB, and DES for WL. 

In what follows in the section, we shall present the results of our forecast for dark energy and 
modified gravity respectively. 

3.1 Dark Energy constraints 

In this section, we focus on DE constraints in the framework of general relativity. Therefore 
we fix the /r and 7 pixels to be unity, but vary the remaining parameters in Eq ( [2.4[ ) simultaneously. 
After marginalising over other parameters, we perform a PCA on the w bins. 

The result is shown in Pigs ^ and In Pig ^ we show the 68 % CL forecasted error on the 
part of the principal components (PCs) for four different data combinations. As shown, within the 
level of a (a,) < 0.5, Planck alone can only constrain 1 mode (the distance to the last scattering 
surface); Planck - 1 - DES can constrain 2 modes, while adding in SKAl or SKA2 can constrain 3 
and 5 eigenmodes to this level. In particular, the best measured modes using SKAl and SKA2 
(combined with Planck and DES) can be determined at the level of a(o;i) = 0.04 and a(o;i) = 
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0.023 respectively. This is a significant improvement given that C7(ai) = 0.25 (Planck alone) and 
a(ai) = 0.13 (Planck + DBS). 

The eigenvectors for the best constrained modes are shown in Fig Roughly speaking, the nth 
best measured mode has n — I nodes, corresponding to the {n — l)th time derivative of w. Having 
SKA helps determining the higher derivatives of w{z), which is key to probe dark energy dynamics. 



Figure 2: The forecasted 68 % CL measurement error on a,-, the coefficient of the ith principal components 
of w{z) + 1, namely, w{z) + 1 = Y.iOliei(z), using different data combinations illustrated in the legend. A 
weak prior of (7(w(z)) < 1 was assumed. 


3.2 Modified Gravity constraint 


Here we consider the most general case, in which we drop the assumption of general relativity. 


Therefore we vary all the parameters in Eq (2.4) simultaneously and focus on the constraint on the 
jj, and 7 . 

Let us study the expected errors on ft(k,z) The error on any /r,y is large, and the pixels have 
highly correlated errors. We take only the /r,y block of the covariance matrix, thus marginalizing 
over all other parameters, including the w, and fij- We invert this block to obtain the Fisher matrix 
for our jj. values, and diagonalize by writing = AW. We expect, from existing 
data, that variations in /r larger than ^(1) are unlikely. We enforce this by applying a prior > 1 to 
the matrix This procedure does not affect the well-measured modes, but gives a reference point 
with respect to which we define poorly consfrained modes. Since we compute fhe full covariance 
mafrix, fhen marginalize over all buf fhe parameter(s) of inferesf, our procedure yields fhe resulfs 
fhaf we would gel for jj. if we simullaneously measured w, 7 , and jj.. This analysis can be repeated 
for 7 or w. 


'We don’t show the 7 constraint here since redshift surveys don’t constrain 7 directly. 
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0.0 0.5 1.0 1.5 2.0 

redshift z 


Figure 3: The best determined eigenvectors (with errors less than 0.5) of w{z) for different data combinations 
shown in the legends. The modes are shown, in the order from better constrained to worse, as black solid, red 
dashed, blue dash-dot, purple dash-dot-dot and brown short dash-dot curves.The short dashed green horizon 
line shows e, (z) = 0. 


Measurements of WL and CMB probe combinations of <I> and 'B, so the effects of 7 , which 
affects only <I>, are mixed with those of jx, which affects both potentials. This yields degeneracy 
between jx and 7 . But this degeneracy can be broken when SKA is combined since it only measures 
through the RSD. 

From Fig. we see that Planck - 1 - DES could not constrain any modes within 10% level, but 
adding in SKAl can easily help to constrain 7 modes to this level. SKA2 can further increase this 
number to 20. In particular 2 and 7 modes can be constrained within sub percent level using SKAl 
and SKA2 respectively. 

Fig. Q shows three best constrained eigenmodes for jx for different data combinations. A first 
observation is that the modes with more nodes (a node appears when eigensurfaces crosses zero) 
are less constrained. This is intuitive: noisy modes are worse constrained than the smooth modes. 
The best modes are mainly functions of k and not z. This is partly because the total observable 
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Figure 4: The forecasted 68% CL error on the coefficients of the principal components of ji {k,z) for different 
data combinations shown in the legend. 


volume in the radial (z) direction is limited by the dimming of distant objects and, ultimately, the 
fact that structures only exist at relatively low z. Also, it is related to us considering only linear 
perturbations in our analysis, since at small z the observable volume is too small to fit the small 
A:-modes that are still in the linear regime. Hence, there is more volume available for studying the 
spatial distribution of structure than the radial distribution. 

For Planck+DES, we see a clear degeneracy in the k and z dependences of the modes. This 
is because changing jj. at some point {k,z) should have the same impact on the observables as a 
change at a larger scale but later time. Interestingly, this k and z dependence goes away when SKA 
is combined. This is simply because SKA constrains jj. very well via the RSD effect, which means 
that data can well distinguish the effect between the variation of /r in k and in z- 

4. Conclusion and Discussions 

In this work we apply the PCA method to investigate the constraint on dark energy and mod¬ 
ified gravify using fhe fufure SKA HI redshiff surveys, combined wifh CMB (Planck) and WL 
(DES) surveys. The PCA mefhod is ideal fo invesfigafe dark energy and modified gravify in a non- 
paramefric way, which efficienfly minimises fhe fheorefical bias sfemming from choosing ad hoc 
functional forms for unknown functions. 

We sfudy dark energy and modified gravify separafely. Eor dark energy equafion-of-sfafe, we 
find fhaf SKA Phase 1 (2) can well consfrain 3 and 5 eigenmodes of w(z) respecfively. The errors 
on fhe besf measured modes can be reduced fo 0.04 and 0.023 for SKAl and SKA2 respecfively, 
making if possible fo probe dark energy dynamics (Zhao el al. 2012). On fhe olher hand, for mod¬ 
ified gravify conslrainfs, SKAl (2) can consfrain 7 (20) eigenmodes of /f (k,z) respecfively wifhin 
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Figure 5: Eigensurfaces for the first three best constrained modes of p after marginalisation over all other 
cosmological parameters. Top row: Planck + DES; Middle: Planck + DES + SKAl; Bottom: Planck + DES 
+ SKA2. 


10% sensitivity level. In particular 2 and 7 modes can be constrained within sub percent level using 
SKAl and SKA2 respectively. 

Imaging and redshift surveys are highly complementary when constraining cosmological pa¬ 
rameters, especially for modified gravity models (Guzik et al. 2010; Song et al. 2011; Gaztanaga 
et al. 2012). The method developed in this work can be directly applied to future surveys of LSST 
(LSST Science Collaboration et al. 2009) and Euclid (Laureijs et al. 2011). For synergy between 
SKA and LSST and Euclid, see Bacon et al. (2014) and Kitching et al. (2014). 
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